Robust SVD Method for Missing Value Estimation of DNA Microarrays

نویسندگان

  • Fen Qin
  • Joseph Collins
  • Jeonghwa Lee
چکیده

A majority of DNA microarray datasets contain missing or corrupt values and it is critical to estimate these values accurately. These missing values are most often attributed to insufficient experimental resolution or the presence of foreign objects on the experimental slide’s surface. To improve existing missing value estimation algorithms, this paper introduces and investigates the scalable singular value decomposition (SSVD) solver, which is an improvement upon the Jacobi singular value decomposition (SVD) approach. Experiments were conducted on a study comparing SSVD to the Jacobi and QR SVD methods against several legitimate microarray datasets. The robustness of SSVD is verified by subjecting it to several test cases containing 1–20% of missing values. For nearly all of the test cases across all configurations of missing value percentages, SSVD provides more accurate recovery results than Jacobi and SQ SVD. These numerical results strongly suggest SSVD is a robust and scalable solver.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Missing value estimation methods for DNA microarrays

MOTIVATION Gene expression microarray experiments can generate data sets with multiple missing expression values. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. For example, methods such as hierarchical clustering and K-means clustering are not robust to missing data, and may lose effectiveness even with a few missing values....

متن کامل

Collateral Missing Value Estimation: Robust Missing Value Estimation for Consequent Microarray Data Processing

Microarrays have unique ability to probe thousands of genes at a time that makes it a useful tool for variety of applications, ranging from diagnosis to drug discovery. However, data generated by microarrays often contains multiple missing gene expressions that affect the subsequent analysis, as most of the times these missing values are ignored. In this paper we have analyzed how accurate esti...

متن کامل

Adjustable Robust Singular Value Decomposition: Design, Analysis and Application to Finance

The Singular Value Decomposition (SVD) is a fundamental algorithm used to understand the structure of data by providing insight into the relationship between the row and column factors. SVD aims to approximate a rectangular data matrix, given some rank restriction, especially lower rank approximation. In practical data analysis, however, outliers and missing values maybe exist that restrict the...

متن کامل

Heuristic Non Parametric Collateral Missing Value Imputation: A Step Towards Robust Post-genomic Knowledge Discovery

Microarrays are able to measure the patterns of expression of thousands of genes in a genome to give profiles that facilitate much faster analysis of biological processes for diagnosis, prognosis and tailored drug discovery. Microarrays, however, commonly have missing values which can result in erroneous downstream analysis. To impute these missing values, various algorithms have been proposed ...

متن کامل

A Simultaneous Reconstruction of Missing Data in DNA Microarrays

We suggest here a new method of the estimation of missing entries in a gene expression matrix, which is done simultaneously— i.e., the estimation of one missing entry influences the estimation of other entries. Our method is closely related to the methods and techniques used for solving inverse eigenvalue problems. 2000 Mathematical Subject Classification: 15A18, 92D10

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011